Intrinsic Data Locality of Modern Scientific Workloads
نویسندگان
چکیده
Understanding the intrinsic data locality of a workload is essential to understanding and predicting cache performance. The intrinsic data locality of a particular application or workload can be measured in a microarchitectureindependent manne,: The data resulting from these measurements ideally can be used to develop an analytic model forpredicting memory performance on different cache sizes and confrgurations. Many studies on data locality use cache hit ratios, a micmarchitecture-dependent metric, to examine locality. In this paper: we present a microarchitecturedependent and a micmarchitecture-independent characterization of the SPEC2000 workloads. We present quantitarive statistics on the diferent types of data locality (e.g.. spatial and temporal) exhibited by these workloads and we show that rhe composite intrinsic locality can be correlated to locality measured by cache hit raria.
منابع مشابه
Quantifying Temporal and Spatial Localities Due to File System Caching
Modern storage consists of many optimizations along the data path, from the front-end file system cache and disk layout management to the back-end disk controller cache and on-disk caching. Various components generally exploit the temporal and spatial localities of a workload to improve performance. However, how localities of a workload are transformed by each optimization is not well understoo...
متن کاملFlexSplit: A Workload-Aware, Adaptive Load Balancing Strategy for Media Clusters
A number of technology and workload trends motivate us to consider a new request distribution and load balancing strategy for streaming media clusters. First, in emerging media workloads, a significant portion of the content is short and encoded at low bit rates. Additionally, media workloads display a strong temporal and spatial locality. This makes modern servers with gigabytes of main memory...
متن کاملFlexSplit: A Workload-Aware, Adaptive Load Balancing Strategy for Media Cluster
A number of technology and workload trends motivate us to consider a new request distribution and load balancing strategy for streaming media cluster. First, in emerging media workloads, a significant portion of the content is short and encoded at low bit rates. Additionally, media workloads display a strong temporal and spatial locality. This makes modern servers with gigabytes of main memory ...
متن کاملAdaptive Request Scheduling for Parallel Scientific Web Services
Scientific web services often possess data models and query workloads quite different from commercial ones and are much less studied. Individual queries have to be processed in parallel by multiple server nodes, due to the computationand data-intensiveness of the processing. Meanwhile, each query is performed against portions of a large, common dataset. Existing scheduling policies from traditi...
متن کاملAccelerating Large Scale Scientific Exploration through Data Diffusion
Scientific and data-intensive applications often require exploratory analysis on large datasets, which is often carried out on large scale distributed resources where data locality is crucial to achieve high system throughput and performance. We propose a “data diffusion” approach that acquires resources for data analysis dynamically, schedules computations as close to data as possible, and rep...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004